|
|
Accession Number |
TCMCG075C28955 |
gbkey |
CDS |
Protein Id |
XP_007008941.2 |
Location |
join(722943..722993,723130..723260,723490..723574,723852..723905,724071..724162,724277..724380,724700..724775,725216..725672,726318..726653,726773..726982,727102..727200,727300..727372,728278..728405,728513..728807,728888..729102,729186..729310,729460..729577,730121..730176,730299..730392,730477..730666,730747..730815,730895..730998,731079..731141,731292..731399) |
Gene |
LOC18585840 |
GeneID |
18585840 |
Organism |
Theobroma cacao |
|
|
Length |
1110aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_007008879.2
|
Definition |
PREDICTED: uncharacterized protein LOC18585840 isoform X2 [Theobroma cacao] |
CDS: ATGTTCACCAAATTCTTCCATAACCACGGCGCCTCACCTCAATCTCCAAAGAGTGATGTTGCAAAAGGAAGTTTGACGTCGGCAGATTTAAATCCCCGTGTGACCGTACACTATGGAATTCCGGCAACTGCTTCGGTTCTGGCTTGTGATCTCATCCAACGACTCGTTGCAGTTGGAACGTTGGATGGGAGGATAAAAGTGATCGGTGGTGAAAACATAGAAGCGCTTCTAGTGTCTCCTAAGCAGTTACCTATCAAAAACTTGGAGTTTCTACAAAATCAAGGTTTTCTTGTTAGCGTGTCAAATGAAAATGAGATTCAGGTCTGGGATTTGGAACAAAGGCAAATAGCTTCCCATATACAGTGGGAGTCCAATATAACTGCTTTCAAAGTCATTCATGGCACTAGCTACATGTATCTCGGAGATGAGCATGGGATGGTGTATGTTATAAAGTATGATGCTGAAGAACACAAGCTTGCCCACCTTCCTTATTATGTTCCCACAAATGTTATAGCTGAAGAGGCTGGGATTTCATCACCTAATCATCCTTCTGTTGTTGGAGTTCTTCCCCAACCTTGTTCTCAGGGAAACAGGGTACTGATTGCTTACGAGAATGGGTTGCTTGCCATCTGGGATATTTCTGAAGATCGAGTTGTTCTAGTTAGAGGCAATAAGGATCTCCAATTGAAAGGCAGAACGACATCTGATTCTCCAGAAGAAAAAAAACTTGAAGTTTCTGACTGCACATCAGACGGTGATGAAGTGAAAGAGATAAGCTCTCTTTGTTGGGCATCGAATGATGGGTCAATTCTTGCAGTTGGTTATGTAGATGGGGATATCATGTTTTGGAACTTATCAACTGCTAACCCTAAAAAGATTCAGCAAGCTGAAAAATCACCCAACAATGTTGTTAAATTACAATTATCATCGGGAGAGAAAAGACTTCCTGTTATTGTTTTACATTGGTCTGCAAACCAATCCTGTGGTGATCATGGCTGCAAGCTCTTTGTCTATGGTGGTGATAACGTAGGATCGGAAGAAGTTCTAACGATCTTAAGCCTTGAATGGACTTCTGGAATAGAAAGTCTGAAATGTGTCAGCCGTATGGACCTTACACCCAATGGCTCTTTTGCGGATATGGTTTTGTTACCAACTGTGGGGGTAACAGAGAGTGGTGGCAATTTGCTATTTATGTTGACAAACCCAGGGCAGTTGCATGTTTATGACGATGCATGCTTGGCCGCCTTACTGTCTCAGCAAGAGAAAACAACTTGTGTTTCTTCAGGACAGTATGTTATGCCCATACCCACTGTTGATCCATGCATGACTGTGAGTAAGCTTGCTTTAGTTTACAGAGATGGGGAATTTTCGAAGGCTCTTTCCAAGATAGTATCAGCCACAAAGCTTAAAGCACCACATACTCCAGCTACAGGGAGTAGAAGGTGGCCTTTGACTGGGGGCTTTCCCAGCCTTCTTTCTGAAACTGCAGATTATCAAGTTGAAAGAGTATATGTGGCAGGTTACCAGGACGGATCTGTTCGAATATGGGATGCCACCTATCCAGCTCTTTCACTTATCTTTGCTCTAGGAACTGAGGTGCCAGGTTTTGACGTTGCTGTTGCAAGTGCATCAGTGTCAGCATTGGAAATTTGCTCCTTAACTCAAAGTGTAGCCATTGGCAATGAATGTGGTATGGTTCGTCTCTACAAACTAACAGTAACTTCTGATGAAATGAGTTTGAACATTGTGAAGGAAACAGAGAAAGAAGTCCATACCTTGCACCAAACAGATGGCCCTCAATGCTTGGCTGTGTTTTCACTCCTCAATTCTCCTGTATGTGTGCTACAATTTGCAAAATTTGGTACCAGACTTGCAGTGGGATTTAATTGTGGCAGGGTTGCAATGGTTGATGTTAGTACATTTTCAGTGTTGTTCATTACAGACAGTTTATCACCCTCAAATTGCCCTGTTGGTTTGTCTGCTATGATATCATTTACAGACAACGATACCTTGGTAAACAGCCCAAGGGATTCTGTATCCACAAGTCTGAATGATAACGAAAAGTGGTTAGCATTCGTAATGACCAAGGATGCATACCTAACAGTTTTAGATGGCACAACTGGCAATGTGGTTAGCTCTCTGTCAATACCTCTGAAAGCGGAGTCAAGTGCCATCTCTATGTACATTTTAGAGGGTGGCAATATAGTCTCTACAGTGCCATCAGAGATCAGTGAAACTAAATTTGAACCTGCACATTCTAGTCCTGACCATGGAATTACTCCAGTAGAAGCTAAATCTGAGATTTCCACTCAAGTGGCATACTTTGGGCAGAGATTAAAGAGTTTACTCATTTTACTTTGTTTTGAGGATGCACTGCATTTATGTTCTATGAAGTCAGTGATTCAGGGGACCGCTGACTCCATATGGGCAGTCAATCTTCCGAAGCAATGTTCTTGGACTTCAGCCTTCAAGATAGATGACAAAGAGTGTGGGTTGGTTCTGCTTTACCGGACTGGAGTCCTTGAAATAAGGTCTATGAAAACTCTTGAGGCGATGGGAGAAAGTTCTTTGATGACTATTCTTAGATGGAACTTCAAAACTAACATGGAAAAGATTATATGTTCTTCAAATAGAGGGCAAATTATACTGATACATGGGTGTGAATTTGCTGCTATATCTATTCTGGCCCTTGAGAATGAGTTCAGGATTCCGGATTCTTTGCCATGCATTCATGATACAGTCCTTGCAGCTGCTTTTGATGCAACTGTTAGTTTATCTCCAAGTCAGAATAAAAGCCAGGATACGGCTCCTGGGATTTTAAGTGGTCTTATTAAGGGCTTAAGAGTAGGTAAGCTGGATCAAAATGTGCAAATTCAGGAAGCTTGTAAGAATGATTTCTCGCATTTGGAGAGCATATTTTCTAGTCCCCCATTCTTAAAGCCTTCCATGGCCAGCACAGATTGGCAAGAAGTACTGGATCTTAACATAGATGACATTCAAATTGACGAACCTGTAACCATCTCATCTTCTTCCGAGAAGATCAAGAATGACAGTAAAGAGCAAAGAACAGAGAGAGAAAGATTATTTGAGGGTGCTGGTACTGATGCAAAACCAAGGCTCAGAACAGCCGAGGAAATCAGGGCTAAGTATAGAGGAGCTGAGGATGCTGCAGCTGCAGCTGCAAGTGCCCGGGACAGGCTTGTAGAGCGGCAGGAAAAACTCGAGAGGATCAACGAACGTACTCAAGAGCTACAAAGCGGGGCTGAGAACTTTGCCTCCATGGCAAATGAACTTGCCAAGAGAATGGAAAAGAAAAAGTGGTGGAATCTATGA |
Protein: MFTKFFHNHGASPQSPKSDVAKGSLTSADLNPRVTVHYGIPATASVLACDLIQRLVAVGTLDGRIKVIGGENIEALLVSPKQLPIKNLEFLQNQGFLVSVSNENEIQVWDLEQRQIASHIQWESNITAFKVIHGTSYMYLGDEHGMVYVIKYDAEEHKLAHLPYYVPTNVIAEEAGISSPNHPSVVGVLPQPCSQGNRVLIAYENGLLAIWDISEDRVVLVRGNKDLQLKGRTTSDSPEEKKLEVSDCTSDGDEVKEISSLCWASNDGSILAVGYVDGDIMFWNLSTANPKKIQQAEKSPNNVVKLQLSSGEKRLPVIVLHWSANQSCGDHGCKLFVYGGDNVGSEEVLTILSLEWTSGIESLKCVSRMDLTPNGSFADMVLLPTVGVTESGGNLLFMLTNPGQLHVYDDACLAALLSQQEKTTCVSSGQYVMPIPTVDPCMTVSKLALVYRDGEFSKALSKIVSATKLKAPHTPATGSRRWPLTGGFPSLLSETADYQVERVYVAGYQDGSVRIWDATYPALSLIFALGTEVPGFDVAVASASVSALEICSLTQSVAIGNECGMVRLYKLTVTSDEMSLNIVKETEKEVHTLHQTDGPQCLAVFSLLNSPVCVLQFAKFGTRLAVGFNCGRVAMVDVSTFSVLFITDSLSPSNCPVGLSAMISFTDNDTLVNSPRDSVSTSLNDNEKWLAFVMTKDAYLTVLDGTTGNVVSSLSIPLKAESSAISMYILEGGNIVSTVPSEISETKFEPAHSSPDHGITPVEAKSEISTQVAYFGQRLKSLLILLCFEDALHLCSMKSVIQGTADSIWAVNLPKQCSWTSAFKIDDKECGLVLLYRTGVLEIRSMKTLEAMGESSLMTILRWNFKTNMEKIICSSNRGQIILIHGCEFAAISILALENEFRIPDSLPCIHDTVLAAAFDATVSLSPSQNKSQDTAPGILSGLIKGLRVGKLDQNVQIQEACKNDFSHLESIFSSPPFLKPSMASTDWQEVLDLNIDDIQIDEPVTISSSSEKIKNDSKEQRTERERLFEGAGTDAKPRLRTAEEIRAKYRGAEDAAAAAASARDRLVERQEKLERINERTQELQSGAENFASMANELAKRMEKKKWWNL |